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ABSTRACT 



This paper investigates the clustering properties of a complete sample of 1041 24/im- 
selected sources brighter than -F^m = 400^Jy in the overlapping region between the 
SWIRE and UKIDSS UDS surveys. With the help of photometric redshift determina- 
tions we have concentrated on the two interval ranges z — [0.6 — 1.2] (low-z sample) and 
z 1.6 (high-z sample) as it is in these regions were we expect the mid-IR population 
^ s >0 , to be dominated by intense dust-enshrouded activity such as star formation and black 

hole accretion. Investigations of the angular correlation function produce an amplitude 
A ~ 0.010 for the high-z sample and A ~ 0.0055 for the low-z one. The correspond- 
. • ing correlation lengths are r$ = 15.9j^'4 Mpc and tq = 8.5 jz^'l Mpc, showing that 

' the high-z population is more strongly clustered. Comparisons with physical models 

for the formation and evolution of large-scale structure reveal that the high-z sources 
are exclusively associated with very massive (M > 1O 13 M0) haloes, comparable to 
those which locally host groups-to-clusters of galaxies, and are very common within 
such (rare) structures. Conversely, lower-z galaxies are found to reside in smaller halos 
(-^min ~ 10 12 M Q ) and to be very rare in such systems. On the other hand, mid- 
IR photometry shows that the low-z and high-z samples include similar objects and 
probe a similar mixture of AGN and star-forming galaxies. While recent studies have 
determined a strong evolution of the 24/xm luminosity function between z ~ 2 and 
z ~ 0, they cannot provide information on the physical nature of such an evolution. 
Our clustering results instead indicate that this is due to the presence of different 
populations of objects inhabiting different structures, as active systems at z < 1.5 are 
found to be exclusively associated with low-mass galaxies, while very massive sources 
appear to have concluded their active phase before this epoch. Finally, we note that 
the small-scale clustering data seem to require steep (p cx r -3 ) profiles for the dis- 
tribution of galaxies within their halos. This is suggestive of close encounters and/or 
mergers which could strongly favour both AGN and star-formation activity. 

Key words: galaxies: evolution - galaxies: statistics - infrared - cosmology: observa- 
tions - cosmology: theory - large-scale structure of the Universe 



1 INTRODUCTION 

Understanding the assembly history of massive spheroidal 
galaxies is a key issue for galaxy formation models. The 
"naive" expectation from the canonical hierarchical merging 
scenario, that proved to be remarkably successful in explain- 
ing many aspects of large-scale structure formation, is that 



massive galaxies generally form late and over a long period of 
time as the result of many mergers of smaller haloes. On the 
other hand, there is quite extensive evidence that massive 
galaxies may form at high redshifts and on short timescales 
(see, e.g. Cimatti et al. 2004; Fontana et al. 2004; Glazebrook 
et al. 2004; Treu et al. 2005; Saracco et al. 2006; Bundy 
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et al. 2006; McLure et al. 2006), while the sites of active 
star formation shift to lower mass systems at later epochs, 
a pattern referred to as "downsizing" (Cowie et al. 1996; 
Heavens et al. 2004). In order to reconcile the observational 
evidence that stellar populations in large spheroidal galaxies 
are old and essentially coeval (Ellis et al. 1997; Holden et 
al. 2005) with the hierarchical merging scenario, the possi- 
bility of mergers of evolved sub-units ("dry mergers") has 
been introduced (van Dokkum et al. 2005; Naab et al. 2006). 
This mechanism is however strongly disfavoured by studies 
on the evolution of the stellar mass function (Bundy et al. 
2006). 

Key information, complementary to optical/IR data, 
has come from sub-millimeter surveys (Hughes et al. 1998; 
Eales et al. 2000; Scott et al. 2002; Knudsen et al. 2006; 
Coppin et al. 2006) which have found a large population of 
luminous sources at substantial redshifts (Chapman et al. 
2005). However, the interpretation of this class of objects 
is still controversial (e.g. Granato et al. 2004; Kaviani et 
al. 2003; Baugh et al. 2005). The heart of the problem are 
the masses of the objects: a large fraction of present day 
massive galaxies already assembled at z ~ 2 — 3 would be 
extremely challenging for the standard view of a merging- 
driven growth. Measurements of clustering amplitudes are a 
unique tool to estimate halo masses at high z, but complete 
samples comprising at least several hundred of sources are 
necessary. 

Recently, Magliocchetti et al. (2007) have reported ev- 
idence for strong clustering for ~ 800 optically very faint 
(R > 25.5), F24 M m ^ 0.35 mjy sources obtained from the 
Spitzer first cosmological survey (First Look Survey - FLS; 
Fadda et al. 2006). Both the clustering properties and the 
counts of such sources are consistent with them being very 
massive proto-spheroidal galaxies in the process of forming 
most of their stars. Furthermore, by assuming a medium 
redshift z ~ 2, their comoving number density appears to 
be much higher than what expected from most semi-analytic 
models. 

The Magliocchetti et al. (2007) work however suffers 
from the lack of information on the redshift distribution of 
the optically faint Spitzer-selected sources and has to rely 
on models based on both template spectral energy distribu- 
tions and on theoretical investigations of the issue of galaxy 
formation and evolution in order to go from the observed 
projected clustering signal to the more meaningful results 
in real space. 

The optical-to-mid-IR depth of the UKIDSS data allows 
us to overcome this problem. Photometric estimates for the 
overwhelming majority (~ 97%) of 24/^m-selected sources 
are now available (Cirasuolo et al. 2007). Furthermore, de- 
spite the somehow poor statistics, the redshift information 
can also allow us for the first time to compare the clustering 
signal of similar sources at different epochs so to investigate 
possible differences and evolution in their large-scale prop- 
erties. To this aim, we will concentrate on two samples, the 
first one which includes galaxies in the z = [0.6 — 1.2] range, 
and a second one made by sources with z > 1.6. Diagnos- 
tics based on mid-IR photometry indicate that these two 
samples are likely made by a very similar mixture of active 
star- forming galaxies and AGN. 

The layout of the paper is as follows: In § 2 we describe 
the parent catalogue and the sample selection. In §3.1 we 



derive the two point angular correlation function, while in 
§3.2 we present the results for the spatial clustering prop- 
erties of the sources in exam. §4 discusses the implications 
of the clustering results on the source properties, and in 
particular for what concerns their halo masses and number 
density. Our main conclusions are summarized in § 5. 

Throughout this work we adopt a flat cosmology with 
Q, m — 0.3 and Q,a = 0.7, a present-day value of the Hub- 
ble parameter in units of 100 km/s/Mpc h = 0.7, and rms 
density fluctuations within a sphere of 8/i -1 Mpc radius 
(7 8 = 0.8 (Spergel et al. 2003). 



2 SAMPLE SELECTION 

For this work we used the Spitzer wide-area infrared extra- 
galactic (SWIRE) survey (Lonsdale et al. 2003; 2004) to se- 
lect sources with fluxes at 24 /im brighter than 400 fiJy (5a 
completeness) in the XMM-LSS field (Surace et al. 2005). In 
order to obtain multi-wavelength information and accurate 
redshift estimates for these sources we limited our analy- 
sis to the region of the SWIRE survey which overlaps with 
the 0.7 square degrees covered by the UKIDSS Ultra Deep 
Survey (UDS - Lawrence et al. 2006) and Subaru imaging 
(Sekiguchi et al. 2005; Furusawa et al. in preparation). The 

5 overlapping Subaru Suprime-Cam pointings provide broad 
band photometry in the BVRi'z' filters to typical 5a depths 
of B = 27.5, V = 26.7, R = 27.0, i' = 26.8 and z = 25.9 
(within a 2-arcsec diameter aperture) . The UDS is the deep- 
est of the five surveys that constitute the UKIDSS survey 
(Lawrence et al. 2006) and for this work we used J and K- 
band imaging from the first data release with a 5a depth 
of Jab = 23.5 and Kab = 23.5 (Warren et al. 2007). Fi- 
nally, the SWIRE survey also provides Spitzer-IRAC mea- 
surements at 3.6, 4.5, 5.8 and 8/j.m (Surace et al. 2005). 

Within the overlapping region between SWIRE and 
UDS we isolated 1184 sources with i*24 Mm ^ 400^iJy, out 
of which 1041 have a reliable optical and/or near-infrared 
counterpart. We take these objects as our working sample, 
UDS-SWIRE hereafter. As extensively described in Cira- 
suolo et al. (2007), photometric redshifts for these sources 
have been computed by fitting the observed photometry 
(9 broad bands from 0.4 to 4.5^im) with both synthetic 
(Bruzual & Chariot 2003) and empirical (Coleman, Wu 

6 Weedman 1980; Kinney et al. 1996; Mignoli et al. 
2005) galaxy templates, by using the public package HY- 
PERZ (Bolzonella, Miralles & Pello 2000). Comparisons 
with spectroscopic redshifts available in the field show 
the good accuracy of the photometric redshifts with a 
Az/(1 + z) ~ 0.05 over the redshift range < z < 6 
(Cirasuolo et al. 2007). 

The redshift distribution N(z) for our 24/^m sources is 
shown by the dotted line in Figure [1] As it is possible to 
appreciate from the Figure, the objects included in the 
UDS, F24 M m ^ 400^i Jy sample are found up to redshifts 
z ~ 3.5. 

In order to investigate possible evolutionary features of the 
large-scale structure as traced by these objects, we have con- 
centrated on two sub-samples: a first one which includes 
both sources with redshifts above 1.6 and those without an 
optical or K-band identification (hereafter called the high- 
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Figure 1. Photometric redshift distribution of sources in the 
UDS-SWIRE, F 24Mm > 400/iJy sample (thin dotted line). The 
two thick solid lines highlight the redshift intervals covered by the 
high-z (blue) and low-z (red) samples. For comparison, the (green) 
dashed line indicates the redshift distribution of all F24 Mm 
400/^Jy sources fainter than R = 24.5. No redshift cut was per- 
formed on this last sample. 



z sample) and a second one with objects within z = 0.6 
and z = 1.2 (hereafter called the low-z sample). There are a 
number of reasons for the above choice, z ~ 1.6 is in fact the 
redshift at which the strong spectral PAH feature centred at 
7.7/xm - indicative of a very intense star-forming activity - 
enters the 24/xm band. The high-z sample is therefore ex- 
pected to include a relevant fraction of galaxies undergoing 
an intense phase of dust-enshrouded stellar formation. Fur- 
thermore, recent studies have proved that a strong 24/im 
emission combined with a very faint or no optical detection 
most likely originates from obscured star-forming galaxies 
at redshifts between about 1.6 and 2.7 (see e.g. Yan et al. 
2005; 2007; Houck et al. 2005). On the other hand, results 
both based on IRS spectroscopy and on diagnostics of the 
ratio between 8/xm and 24^tm fluxes, have inferred a frac- 
tion of z ~ 2 obscured AGN brighter than our chosen 24/im 
flux limit of about 30%, figure which rapidly increases to 
~ 100% at the highest 24/^m fluxes (see e.g. Brand 2006; 
Magliocchetti et al. 2007) . The counts presented in the right- 
hand panel of Figure 2 seem to confirm this expectation 
(~ 65% candidate starburst galaxies on the basis of their 
^8/im/-F24/jm < 0.1 ratios, with the remaining ~ 35% being 
made of candidate AGN-dominated objects). 

The low-z sample is also expected to probe a substan- 
tial fraction of obscured star-forming galaxies and AGN, al- 
beit with lower intrinsic luminosities since we are dealing 
with a flux-limited survey. The 24pm counts presented in 
the left-hand panel of Figure 2 are remarkably similar to 
those obtained for the high-z sample (~ 60% and ~ 40% 
respectively for candidate starburst galaxies and candidate 
AGN-dominated objects; we note that the lower limit of 
z ~ 0.6 adopted for the low-z sample corresponds to the 



lowest redshift at which a meaningful distinction between 
AGN-powered and SF-dominated sources as based on the 
ratio between i*8 MIn and i ? 24 flm fluxes can be made given the 
spectral properties of intense star-forming galaxies in the 
mid-IR). Furthermore, from recent multi- photometry stud- 
ies which extend from the optical to the X-ray band, it seems 
that z ~ 1 corresponds to the 'bulk' of obscured AGN ac- 
tivity (see e.g. Pozzi et al. 2007). A comparative analysis of 
the two samples would then allow to investigate any redshift 
evolution in the large-scale properties of these sources and, 
if present, find an evolutionary connection between objects 
in the low-z and high-z samples. 



The high-z sample includes 210 sources, 28 of which do 
not have an optical counterpart. The redshift distribution 
of the objects with assigned photo-z closely resembles that 
of R 24.5, 24/im-selected galaxies (cfr. Figure [TJ, showing 
once again that optically-faint sources selected at 24//m most 
likely reside at redshifts z > 1.6. The projected distribution 
onto the sky of the high-z sources is shown by the filled (red) 
circles on the right-hand side of Figure 3. Small dots indicate 
the positions of all F 2 ^m > 400/iJy UDS-SWIRE objects. 
By making use of the redshift distribution shown in Figure [TJ 
and assuming that the N(z) of optically obscured sources 
follows that of galaxies with estimated redshift, we find that 
the average redshift for the above sample is < z >= 2.02, 
its median is z m ed = 1-93, and the mean comoving number 
density of such sources is tig = (2.5 ± 0.3) • 10" 5 Mpc~ 3 , 
number which decreases to nc = (2.2 ±0.3) ■ 10 -5 Mpc -3 if 
one only includes objects with redshifts. The errors associ- 
ated to the above quantities represent the 2a confidence level 
as derived from Poisson statistics. Estimates of the number 
density at z ~ 2 might be very tricky due to joint effect 
of the cosmological evolution of the sources under examina- 
tion and to the large variance of the observed 24/xm SED 
in that redshift range. This is why, for a safety check, we 
have also repeated our calculations of na by simply con- 
sidering a redshift box extending from z — 1.6 to z — 2.5. 
The result is virtually identical to that quoted above both 
by including and excluding galaxies with no redshift infor- 
mation and also very similar to that obtained by extrapo- 
lating the z ~ 2 Caputi et al. (2007; cfr. their Figure 8) Lu- 
minosity Function for 24^m-selected galaxies brighter than 
vL v ~ lO n ' 9 Z/0, where the limiting luminosity has been 
calculated for F24 iaa = 0.4 mjy by following an Arp220-like 
Spectral Energy Distribution, indicative of galaxies under- 
going intense star-formation which we expect (cfr. earlier in 
this section and also Figure 2) to dominate our sample. 



The low-z sample instead contains 350 sources. Their 
sky distribution is represented by the filled (red) circles 
on the left-hand side of Figure 3. The average redshift 
of the sample is found to be < z >= 0.79, its median 
-Zmcd = 0.84, and the comoving number density of these 
sources is nc = (9.9 ± 1.0) ■ 10 -5 Mpc -3 . Even in this case, 
the number density is in excellent agreement with the find- 
ings of Caputi et al. (2007) for their z ~ 1 sources with 
vL v > 10 11,3 Lq, where vL v was calculated as indicated be- 
fore. The most relevant properties for the high-z and low-z 
samples are summarized in Table 2. 



4 Manuela Magliocchetti et al. 



100 



m 
-a 




FluX 24„m [ mJ y] 



100 



CO 

-a 

\ 
Z 

XI 




Flux 



24 /i,m 



[mJy] 



Figure 2. Differential number counts for candidate star-forming galaxies (F8iua/F2&pm < 0.1; green squares) and candidate AGN- 
dominated sources (^8^111/^24^111 ^ 0.1; red circles) in the low-z (left-hand panel) and high-z (right-hand panel) samples. 
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Figure 3. Sky distribution of UDS-SWIRE, -F24 Mm ^ 400/iJy sources (small dots). The filled (red) circles in the left-hand panel illustrate 
the 350 objects included in the redshift interval z = [0.6 — 1.2], while those in the right-hand panel represent the 210 sources with either 
z > 1.6 or no optical or near-IR identification. 



3 CLUSTERING PROPERTIES 



(Hamilton 1993) 



3.1 The Angular Correlation Function 

The angular two-point correlation function w(9) gives the 
excess probability, with respect to a random Poisson distri- 
bution, of finding two sources in the solid angles 8Q1 SH2 
separated by an angle 9. In practice, w(6) is obtained by 
comparing the actual source distribution with a catalogue 
of randomly distributed objects subject to the same mask 
constraints as the real data. We chose to use the estimator 



w{0) = 4 x 



DD_RR 

{DRY 



(1) 



where DD, RR and DR are the number of data-data, 
random-random and data-random pairs separated by a dis- 
tance 9. 

The two considered UDS-SWIRE samples have an es- 
timated 5cr flux completeness Faipm ^ 400/iJy. Further- 
more, since the whole UDS area was uniformly observed by 
Spitzer/MIPS, we only had to mask out those (few) portions 
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Figure 4. Angular correlation function w(8) for the low-z (left-hand panel), and high-z (right-hand panel) samples. The dashed lines 
show the best power-law fits to the data, while the dotted line in the left-hand panel indicates the variation in the best fit obtained by 
considering the w(8) + 1<t values at those scales where w(9) is negative as mere upper limits. The two horizontal (green) dashes in the 
plot for the high-z sample represent the small-scale variation of w(9) derived by either including or excluding the pair of sources whose 
distance from each other is < 5.4" (see text for details). 



of the sky contaminated by the presence of bright stars. Ran- 
dom catalogues covering the whole surveyed area minus the 
bright star regions and with twenty times as many objects 
as the real data sets were then generated for both the high-z 
and low-z samples. w{9) in eq. fTJ was then estimated in 
the angular range 10~ 3 < < 0.3 degrees. The upper limit 
is determined by the geometry of UDS, and corresponds to 
about half of the maximum scale probed by the survey. 

Figure 4 presents our results for the two low-z (left-hand 
panel) and high-z (right-hand panel) samples, while Table 1 
reports the w(8) values as a function of angular scale in 
both cases. The error bars show Poisson estimates for the 
points. Since the distribution is clustered, these estimates 
only provide a lower limit to the uncertainties. However, 
it can be shown that over the considered range of angular 
scales this estimate is close to those obtained from bootstrap 
resampling (e.g. Willumsen, Freudling & Da Costa, 1997). 

Particular attention was devoted to the issue of close 
pairs. In fact, the points on the top left-hand corners of 
both w(9) estimates correspond to angular scales close to 
the 5.4" resolution of the 24^im Spitzer channel and there- 
fore could be affected by source confusion. However, since 
optical and near-infrared images for these sources have a 
much better resolution (~0.8"), we can use this information 
and consider as 'good' pairs all those made of galaxies which 
are identified in the Subaru/UDS images, while we regard 
as potentially spurious those made of galaxies without such 
optical/near-IR counterparts. For the low-z sample, there 
are 11 pairs with distances between 0.001 and 0.003 degrees. 
Of these, only one was found to be closer than 5.4" and both 
IRAC and R-band photometry indicate that we are dealing 
with two distinct sources. 10 close (i.e. again with distances 
between 0.001 and 0.003 degrees) pairs are instead found in 
the high-z sample. Of these, only one is at a distance below 



5.4". Unfortunately, the two sources of this pair are uniden- 
tified both in the optical and in the near-IR bands, and so 
the pair has to be considered as spurious. 

The angular correlation function for the high-z sample 
was then computed by both including and excluding the 
possibly spurious pair. The small-scale (9 ~ 0.0015°) results 
are shown by the two horizontal dashes in the right-hand 
panel of Figure 4. The filled circle represents their average. 
As it is possible to see, the w(6) variation caused by the 
eventual presence of the spurious pair is almost negligible, 
and surely smaller than the error associated to the determi- 
nation of w(ff). Nevertheless, in the following we will use as 
the 'best' small-scale point, that obtained as the average of 
the two estimates and the associated error will be the sum 
in quadrature of the Poisson one and of the variation due to 
the inclusion/exclusion of the pair. 

If we then assume a power-law form for w(8) = A9 \ 
we can estimate the parameters A and 7 by using a least- 
squares fit to the data. Given the large errors on w we choose 
to fix 7 to the standard value 7 = 1.8. Although somewhat 
arbitrary, this figure and its assumed lack of dependence on 
redshift is justified by LSS observations of large enough sam- 
ples of high redshift sources so to allow for a direct estimate 
of the slope of w{9) at different look back times (e.g. Por- 
ciani, Magliocchetti & Norberg 2004; Le Fevre et al. 2005). 
The small area of the UDS survey introduces a negative bias 
through the integral constraint, J w eat dQ = 0. We allow for 
this by fitting to AO 1 ^ - C, where C = 2.54. 
The dashed lines in Figure 4a and 4b represent the best 
power-law fits respectively to the low-z and high-z data. The 
associated best-fit values for the amplitude are A lz = [3.8 ± 
1.6] • 10" 3 for the low-z sample, and A 1 ™ = [10.0 ±3.5] • 10" 3 
for the high-z sample. 

The amplitude for the high-z sample is in good agree- 
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Table 1. Values for the angular correlation function w(9) as ob- 
tained for the low-z and high-z samples. Error bars are obtained 
from Poisson statistics. 

9 [Deg] Low-z sample High-z sample 



0.0015 
0.0045 
0.0075 
0.0150 
0.0450 
0.0750 
0.1500 
0.2500 



1.9±0.9 
0.6 ±0.4 
-0.19 ±0.25 
0.10 ±0.07 
0.07 ±0.04 
-0.004 ± 0.03 
-0.0004 ± 0.01 
-0.01 ±0.01 



3.8 ± 1.7 
0.8 ± 1.8 
0.5 ±0.6 
0.2 ±0.1 
0.05 ± 0.08 
0.08 ±0.05 
0.03 ±0.02 
-0.013 ±0.017 



merit with that obtained by Magliocchetti et al. (2007) by 
analyzing a sample of ~ 800 heavily obscured (R ^ 25.5) 
Spitzer-selected sources from the First Look Survey (FLS, 
Fadda et al. 2006) with i<24/*m ^ 0.35 mjy. This provides us 
with some confidence that cosmic variance is not a cause for 
main concern in our analysis of high-z UDS-SWIRE sources. 
Furthermore, as already noticed by the above authors, in 
this case A is about four times higher than that derived by 
Fang et al. (2004) for a sample of IRAC galaxies selected 
at 8/im (A ~ 2.34 ■ 10~ 3 ), and about ten times higher than 
that obtained by Magliocchetti et al. (2007) for the whole 
F 2 4 M m > 0.35 mjy FLS dataset (A = [9 ± 2] ■ 10" 4 ). 

The case for the low-z sample is more tricky to deal 
with. The inferred value for the amplitude of its angular cor- 
relation function in fact suggests that UDS-SWIRE sources 
at z ~ 1 are sensibly less clustered (about a factor three 
in angular signal) than higher redshift ones. However, the 
computed w(9) for the low-z sample presents puzzling neg- 
ative values in four 9 bins which should instead correspond 
to w(9) > 0. We have re-analyzed the sample a number of 
times by both varying the bin size and by adopting differ- 
ent prescriptions and dimensions for the random catalogue 
in the calculation of w(9) and we can conclude that the ob- 
served negative values for w(9) axe indeed a real feature of 
the z ~ 1 sample. However, we cannot quantify how much of 
this trend is affected by poor statistics and/or cosmic vari- 
ance (for instance, Gilli et al. 2007 find a variation in the 
number of z ~ 1, 24^im-selected sources brighter than ap- 
proximately our chosen limit of almost a factor 2 between the 
two GOODS fields set in the northern and in the southern 
sky). While we note that also the Gilli et al. (2007) correla- 
tion function as estimated at z ~ 0.7 for the GOODS South 
exhibits a dip at scales ~ 0.2/ h Mpc which - for the cho- 
sen cosmology - correspond to the 9 ~ 0.008° bin where we 
find a negative value for w(8), nevertheless caution has to be 
used when dealing with the four negative w(9) figures. This 
is why we have decided to lower the weight of these points 
in our \ 2 analysis by considering the estimated w(9) + la 
values (see Table 1) as mere upper limits. The resulting am- 
plitude of the angular correlation function for low-z UDS- 
SWIRE galaxies in this case is A lz = 5.5±2.o ' 10" 3 , about 
40 per cent higher than what previously found by includ- 
ing measured data points for w(9) at all scales. This is the 
value which we will adopt throughout the paper and which 
- though allowing for the large uncertainties - still results 
to be a factor ~2 lower than that derived from the high-z 
sample. We will investigate the implications of this result in 
the following sections. 



3.2 Relation to spatial quantities 

The standard way of relating the angular two-point correla- 
tion function w(9) to the spatial two-point correlation func- 
tion £(r, z) is by means of the relativistic Limber equation 
(Peebles, 1980): 



,(0) = 2 



Jo°° JT F- 2 (x)x 4 $ 2 (x)£(r, z)dx du 
L/T F- 1 {x)x 2 ^{x)dx\ 2 



(2) 



where x is the comoving coordinate, F{x) gives the correc- 
tion for curvature, and the selection function $(:r) satisfies 
the relation 



^(x)F~ 1 {x)x 2 dx - 



«7 



N(z)dz, 



(3) 



in which J\T is the mean surface density on a surface of solid 
angle Q s and N(z) is the number of objects in the given sur- 
vey within the shell (z, z + dz). If we adopt a spatial correla- 
tion function of the form £(r, z) — (r/ro) -1 ' 8 , independent of 
redshift in the considered intervals, and we consider the red- 
shift distributions presented in Figure [T] for the adopted cos- 
mology we obtain = 15.91^ Mpc and r z = 8.5±^;| Mpc 
(where both quantities are comoving and this latter value 
has been calculated for an amplitude A — 0.005 found by 
following the method explained in §3.1), respectively for the 
high-z and low-z sample. In order to test for the constancy 
of ro in the somehow wide redshift range z = [1.6 — 3.5], we 
have also computed w(9) and ro by using a subsample of 178 
high-z galaxies within the narrower interval z = [1.6 — 2.4]. 
The results for both A = 0.011 ±0.004 and r ~ 15 Mpc are 
virtually identical to those reported here and in §3.1. 

Even though allowing for the large uncertainties, the 
above figures indicate that the high-z sample is more 
strongly clustered than the low-z one. In fact, despite the 
large error-bars, the inferred correlation lengths are incom- 
patible with each other at the ~ 3cr confidence level. The 
ro value for the high-z sample perfectly matches that of 
~ 15 Mpc obtained by Magliocchetti et al. (2007). This is 
also in agreement with the estimates obtained in the case 
of ultra- luminous infrared galaxies over 1.5 < z < 3 (Far- 
rah et al. 2006a,b). Spitzer-selected galaxies found in the 
range 1.5 < z < 3 thus appear to be amongst the most 
strongly clustered sources in the Universe. Similar values 
have been recently obtained by Foucaud et al. (2007) for 
their z ~ [1 - 2] UKIDSS-UDS sample of distant red galax- 
ies (DRG). Although with larger uncertainties, Grazian et 
al. (2006) also report a correlation length of the order of 
13/ft Mpc for their DRG, 2 < z < 3 dataset. A very high 
clustering length was also found by Magliocchetti & Maddox 
(1999) in their statistical analysis of high redshift galaxies 
in the Hubble Deep Field North. All these values point to an 
evolutionary connection between galaxies undergoing an in- 
tense star-formation activity such as those probed by Spitzer 
and the HDF beyond z ~ 2 and older sources such as distant 
red galaxies. According to this picture, these two classes of 
objects would merely correspond to different stages in the 
evolution of a very massive galaxy. We note that locally, the 
clustering properties of z ~ 2 active galaxies find a coun- 
terpart only in those exhibited by radio sources (see e.g. 
Magliocchetti et al. 2004) and early type L > 4 L* galaxies 
(see e.g. Norberg et al. 2002) and are only second to those 
of rich clusters of galaxies (e.g. Guzzo et al. 2000). It is then 
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Table 2. Properties of the two i*400ftm 400 mjy samples considered in this work. From left to right the columns show the average 
redshift < z >, the number of sources, the comoving number density ng (in [10 — 5 Mpc -3 ] units), the amplitude A of the angular 
correlation function, the comoving clustering length ro (in Mpc units), the minimum mass AiffiF^ (in Mq units) for the dark matter 
halos, normalization N^ FW and slope of the HON o NFW - all three estimated under the assumption of a NFW profile for the distribution 
of galaxies within their dark matter halos — , the same quantities evaluated under the assumption of a steeper p oc r~ 3 (PL3) profile, and 
the large-scale bias b. 

< z > N n G A- 10" 3 r LogMNfW LogNg 1 ™ a NFW LogM^ 3 LogN PL3 a PL3 bias 

2.02 210 2.5 ±0.3 10.0 ±3.5 15.9±3.4 W.stpj ~°- 7 -Q.S °- s -o.l 12 - 8 ^o.2 -°- 7 io.5 °- 7 ta5 6 - 17 
0.79 350 9.9 ±1.0 5.5+2 Q 8.5+^g H-9±as - 2 -°-o.9 °- 8 -o.2 n - 8 -o.8 ~ 2 - 1 -o.9 °- 7 ^a3 L70 



natural trying to envisage a connection between these latter 
objects with the high-z ones, whereby massive galaxies un- 
dergoing intense star-formation at redshifts z ~ 2 end up as 
the very bright central galaxies (passive objects with a high 
probability for enhanced radio activity, see e.g. Best et al. 
2007; Magliocchetti & Bruggen 2007) of local clusters. 

On the other hand, Spitzer-selected sources residing at 
z ~ 1 are less strongly clustered. Interestingly enough, both 
Magliocchetti & Maddox (1999) and Grazian et al. (2006) 
find similar results for their 1 < z < 2 HDF and DRG 
samples. Our results for the low-z sample also closely re- 
semble those recently obtained by Gilli et al. (2007) who 
find a correlation length ro ~ 8 Mpc for their joint sample 
of 0.5 < z < 1, Lih > 10 11 Lq galaxies selected at 24/im in 
the two GOODS fields. 

Locally, the clustering properties of mid-IR bright sources 
mirror those of 'normal' early-type galaxies (e.g. Madgwick 
et al. 2003; Zehavi et al. 2005). 



4 CONNECTION WITH PHYSICAL 
PROPERTIES 

A closer look at Figures 4a and 4b shows that a sim- 
ple power-law cannot provide a good fit to the measured 
w(0) below 6 < 0.003°. This small-scale steepening is inti- 
mately related to the way the sources under consideration 
occupy their dark matter haloes, an issue which can be dealt 
with within the so-called Halo Occupation Scenario. For de- 
tails, we refer the interested reader to the work of Porciani, 
Magliocchetti & Norberg (2004). Basically, the halo occu- 
pation framework relates the clustering properties of a cho- 
sen population of extragalactic objects with the way such 
objects populate their dark matter haloes. Within this sce- 
nario, the two-point correlation function £ can be written 
as the sum of two components, f and £ 2h , where the first 
quantity accounts for pairs of galaxies residing within the 
same halo, while the second one takes into account the con- 
tribution to the correlation function of galaxies belonging 
to different haloes. The quantities £ lh and £, 2h depend on 
a number of factors, amongst which cosmology, spatial dis- 
tribution of sources within their haloes, mass function of 
the dark matter halos, dark matter auto-correlation func- 
tion, large-scale bias, and also on the first {N) and second a 
moments of the halo occupation distribution Pn(M) which 
gives the probability of finding N galaxies within a single 
halo as a function of the halo mass M. On the other hand, for 
a given cosmology, given the halo mass function n(M) (num- 
ber density of dark matter haloes per unit comoving volume 
and log 10 (M)), the first moment of the halo occupation dis- 



tribution (AT) (hereafter called halo occupation number or 
HON) completely determines the mean predicted comoving 
number density n of galaxies in the desired redshift range: 

n= J n(M) (N(M)) dM . (4) 

Any consistent picture within the halo occupation scenario 
has to be able to simultaneously reproduce at least the first 
(mean number density) and second (two-point correlation 
function) moment of the observed galaxy distribution. 

One common way to parametrize the HON and variance 
of the galaxy distribution is (Porciani et al. 2004; see also 
Hatton et al. 2003): 

N(M \ - / N o(M/M min ) a if M > M min 
1 ' \ ifM<M min ' 

a(M) = f3{M) 2 N(M), (5) 

where /3(M) = 0,log(M/M min )/log(M/M ), 1, respectively 
for N(M) = 0, N(M) < 1 and N(M) > 1. The operational 
definition of Mo is such that N(Mo) = 1, while M m i n is 
the minimum mass of a halo able to host a source of the 
kind under consideration. More and more massive haloes 
are expected to host more and more galaxies, justifying the 
assumption of a power-law shape for the halo occupation 
number. As for the variance a(M), we note that the high- 
mass value for /3(M) simply reflects the Poisson statistics, 
while the functional form at intermediate masses (chosen to 
fit the results from semi-analytical models and hydrodynam- 
ical simulations, see e.g. Sheth & Diaferio 2001 and Berlind 
et al. 2003) describes the (strongly) sub-Poissonian regime. 

In more operative terms, we have calculated £ lh and £ 2,1 
using the Sheth & Tormen (1999) prescriptions for the halo 
mass function and the large-scale bias, while the mass auto- 
correlation function was computed by following a revised 
version of the method of Peacock & Dodds (1996) which 
takes into account the spatial exclusion between haloes (i.e. 
two haloes cannot occupy the same volume). As a starting 
working hypothesis, the radial profile of the galaxy distri- 
bution within their halos is assumed to follow that of the 
dark matter for which we adopt a NFW profile (Navarro, 
Frenk & White, 1997). Finally, we allowed the parameters 
in eq. (JSJ to vary within the following ranges: 
< a sS 2; 10 11 Af m in/M Q < 10 14 ; -3 < log 10 (AT ) < 0. 

Values for these three parameters have been then deter- 
mined through a minimum \ 2 technique by simultaneously 
fitting the observed w(0) (where for the low-z sample we 
have treated the negative w(0) values as in §3.1) and the 
estimated number density no of sources in both the low-z 
and high-z samples (cfr. Table 2). The angular correlation 
function was computed from eq. ([2]), with £ obtained as ex- 
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Figure 5. Angular correlation function w(9) for the low-z (left-hand panel), and high-z (right-hand panel) samples. The solid (blue) 
curves represent the best HON fits to the data obtained under the assumption that the distribution of galaxies within their dark matter 
haloes mirrors that of the dark matter (chosen to be described by a NFW profile, cfr. Section 4), while the dashed (green) curves are 
the best fits obtained for a steeper galaxy profile, p oc r~ 3 . The long-short dashed (cyan) curves correspond to the best 1-parameter fits 
obtained by setting in eq. (5) a = and No = 1. The negative w (9) data-points in the \ 2 analysis of the low-z sample have been treated 
as in §3.1 (see text for details). 



plained above and the redshift distributions shown in Figure 
1. The best-fit parameters for the HON of both the low-z and 
high-z sample are summarized in Table 2; the quoted errors 
correspond to A\ 2 = 1. The uncertainties associated with 
a and especially to Mmin are much smaller than those cor- 
responding to the normalization parameter No as while the 
first two quantities are constrained by both the large-scale 
and small-scale behaviour of £, according to our model No 
only enters the description of the spatial two-point correla- 
tion function in the sub-halo regime (cfr. eqs 13 and 14 of 
Porciani et al. 2004). 

The theoretical angular correlation functions corre- 
sponding to the best-fit HON parameters for both the low-z 
and high-z sample are represented by the solid (blue) curves 
in Figure 5. Since the high-z sample includes a fraction of 
sources without estimated redshifts, we have also re-run the 
HON method by only including in the calculation of the co- 
moving number density no sources with assigned photo-z. 
The results are virtually identical to those reported in Ta- 
ble 2. 

As it is possible to notice from Figure 5, the HON does a 
good job at reproducing the observed w(8) at intermediate- 
to-large angular scales both in the case of the low-z sample 
and for the high-z sample. However, as it is particularly ev- 
ident in the low-z plot, it cannot successfully describe the 
steep rise of the measured two-point correlation function 
below ~ 0.005 degrees. The reason is quite easy to under- 
stand: strong contributions to the 1-halo correlation func- 
tion £ 1 (i.e. the portion of cj which determines its behaviour 
on scales smaller than the virial radius) are mainly due to 
high values of a and No. The magnitude of a, together with 
that of Af m i n , is however also determined by the large-scale 
behaviour of w(6). Furthermore, all three quantities enter 



the calculation of the predicted number density n via equa- 
tion ([4]) and the number of possible values and combina- 
tions is strictly ruled by the requirement that the predicted 
n matches (within 2a in our case) the observed, nc, one. It 
follows that the best-fit curves plotted in Figures 5a and 5b 
below 6 ~ 0.03° in the low-z case and 9 ~ 0.01° for the high- 
z sample represent the largest possible small-scale contribu- 
tions to the observed w(0) subject to the constraints put by 
the large-scale behaviour of w(0) and by the observed num- 
ber density of the sources that produce the clustering signal. 
Furthermore, we note that large values for a and No, as well 
as more extreme choices for the second moment a(M) of the 
galaxy distribution in (5), would only determine a boost of 
w(0) on the smallest angular scales probed by our analysis 
(i.e. an approximately vertical shift of the projected contri- 
bution of £ ), but cannot radically change its shape, which 
shows a smooth rise followed by a plateau in net contrast 
with the steep jump of the observed data points on scales 
smaller than at least 6 ~ 0.003 degrees. The model as it is 
cannot do any better than this. 

The most likely solution to this 'small-scale crisis' comes 
from releasing the assumption that the distribution of galax- 
ies within their halos follows that of the dark matter. This 
working hypothesis was used both for simplicity and also 
because locally the halo distribution of both late-type and 
early-type galaxies can be nicely described by a NFW profile 
(see e.g. Magliocchetti & Porciani 2003) . However, this does 
not have to be necessarily true at high redshifts. As an alter- 
native approach, we have then considered the possibility for 
the galaxies to be more concentrated than the underlying 
dark matter, and in order to reproduce this effect we have 
taken a galaxy density profile of the kind p oc r~ 3 (hereafter 
called PL3; see Magliocchetti & Porciani 2003). 
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The best fitting curves obtained within the HON frame- 
work in the PL3 case are shown in Figures 5a and 5b by the 
dashed (green) curves. The corresponding best-fitting val- 
ues for M m i n , a and Ao are given in Tabie 2. As expected, a 
model which now foresees a steep profile for the galaxy dis- 
tribution can provide an excellent fit to the observed small- 
scale rise of w(8) in both the low-z and high-z sample. Fur- 
thermore, the best fitting values for the HON in the PL3 
case are almost identical to those obtained by considering 
NFW profiles. This shows that the average behaviour for 
the spatial distribution of galaxies within their dark matter 
halos can be recovered from investigations of the small-scale 
features of the angular/spatial two-point correlation func- 
tion in a 'clean' way as such features can be modelled inde- 
pendently of the other parameters adopted to describe the 
clustering properties of a given astrophysical population. 

So, which kind of information can we infer from this 
small-scale behaviour of w(6)? Locally, Magliocchetti & Por- 
ciani (2003) find that galaxy distributions of the PL3 kind 
are strongly rejected by the data, as small-scale measure- 
ments of the two-point correlation function prefer smoother 
profiles such as NFW or p oc r -2 ' 5 . However, at very low 
redshifts it is reasonable to expect the majority of dark 
matter halos to be virialized and therefore be to reason- 
ably described (at least beyond the very small scales, see 
e.g. Bukert 1995) by NFW-like profiles. Two options are in- 
stead possible at such high redshifts. The first one is that 
galaxies still do trace the underlying distribution of the dark 
matter within their halos, and it is the dark matter itself 
which does not follow NFW-like profiles. The second option 
is instead that galaxies are simply more radially concen- 
trated than the dominant dark matter counterpart. What- 
ever the favourite choice, one thing is clear. On sub-halo 
scales, 24/im-selected galaxies at redshifts z > 0.6 are more 
strongly radially concentrated than their low-redshift coun- 
terparts. Such a strong concentration is indicative of a strong 
interaction between galaxies which reside in the proximity of 
the halo centres, and we expect such a strong interaction to 
eventually drive a substantial number of mergers. It is pos- 
sibly not a random coincidence that bright 24/im-selected 
sources at intermediate-to-high redshifts are in general as- 
sociated to very intense star-forming systems; the excess of 
proximity between such galaxies would eventually lead to 
gas-rich mergers which could easily trigger enhanced star- 
formation activity like the ones observed. In passing, we 
note that evidence for an upturn (although less significative 
than ours) on scales r < 0.2/h Mpc has also been recently 
reported by Gilli et al. (2007) for their sample of z ~ 1, 
24/im-selected sources in the GOODS fields. 

The best average number of galaxies (AT) as a function 
of halo mass for the two analysed samples (and in the PL3 
case) are presented in Figure 6 (solid/red line for the high-z 
sample), while the galaxy mass function as coming from eq. 
(4) for the best-fit values of the HON for the high-z sam- 
ple (solid/red curve) and in the case of the low-z sample 
(dashed/green curve) are illustrated by Figure 7. Despite 
the large uncertainties due to the relatively poor statistics, 
a number of interesting features can be gathered from these 
plots. Sources belonging to the high-z sample appear to be 
exclusively located in very massive, M > 1O 13 M0, struc- 
tures, identifiable with groups-to-clusters of galaxies. These 
galaxies are quite common within their halos ((N) between 
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Figure 6. Average number of galaxies (TV) per dark matter halo 
of specified mass M (expressed in Mq units). The solid (red) 
line represents the case for the high-z sample, while the dashed 
(green) line is for the low-z sample. 
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Figure 7. Number of galaxies per unit of (log) dark matter mass 
and volume as evaluated following eq. (4) for the best fitting values 
of the HON presented in Figure 6 at the effective redshifts z ~ 
0.8 and z ~ 2 of the two datasets under exam. The solid (red) 
line represents the result for the high-z sample, while the dashed 
(green) line is for low-z sample. For comparison, the two dotted 
lines indicate the Sheth &: Tormen (1999) mass function n(M) of 
dark matter haloes (the higher/green curve corresponding to the 
low-z sample). The filled triangle indicates the observed number 
density of sources in the high-z sample, while the filled square is 
for the low-z sample. 
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Figure 8. Angular two-point correlation function w(8) for low-z 
UDS-SWIRE sources brighter than 0.5 mjy (210 objects; green 
squares and associated, dotted, errorbars) as compared to that 
presented in §3.1 (red circles). The dashed line represents the 
best-fit to the low-z, i<24/jm ^ 0.4 mjy, sample obtained as in 
§3.1 



~ 0.6 and ~ 0.06 at the lowest possible masses where the 
sources start appearing), and we note that since (N) is a 
quantity which is averaged all over the dark matter halos 
more massive than M m i n , in the most extreme case our re- 
sults imply that more than one in two of all the structures 
with masses M ~ 10 13 M Q host a high-z, -F24/jm 400/iJy 
galaxy. Furthermore, their number sensibly increases with 
the richness of the structure which hosts them. At around 
M ~ 10 15 M o , their average abundance is between 5 and 20 
sources per halo (~ 10 for the best-fit model), value which is 
not in disagreement with the typical figures for the number 
of bright, L > L* galaxies associated to local rich clusters 
(e.g. Terlevich, Caldwell & Bower, 2001). 

On the other hand, the low-z sample seems to be made 
of galaxies of much smaller mass (M min ~ 10 12 M Q ). Also, 
these objects are very rare (in the most unfavourable case we 
get (N) ~ 10 -3 at the lowest possible masses at which the 
sources start appearing, i.e. only 0.1% of the 'allowed' halos 
host a galaxy of the kind which make the low-z sample), 
even though their number - as it was the case for the high-z 
sample - sensibly increases with halo mass; this determines 
an overall flatness of the galaxy mass function at least up to 
~ 1O 14 M . 

From the analyses performed in Sections 3 and 4 it appears 
clear that the populations probed by the high-z and low- 
z samples are very different from each other. One might 
argue that the higher clustering level reported at redshifts 
2 > 1.6 can be simply attributed to the fact that since we 
are dealing with a flux-limited survey rather than a volume- 
limited one, higher redshifts probe more intrinsically lumi- 
nous sources which are in general hosted by more massive 
halos and therefore result more strongly clustered. However, 
we note that this cannot be the only explanation for such 



a strong difference between the clustering properties of the 
low-z and the high-z sample. Sources in the high-z sample 
are in fact so strongly clustered that it would only take a 
small fraction of them in the low-z sample (say of the or- 
der of 20%) to sensibly boost the low-z clustering signal. 
This does not seem to happen. Obviously, the ultimate evi- 
dence for a strong evolution of the clustering signal of 24/^m- 
selected sources between z ~ 2 and z ~ 1 could in principle 
only be obtained by comparing volume-limited rather than 
flux-limited samples. Given the Spectral Energy Distribu- 
tions of these sources, this would imply including in our 
analysis only objects brighter than ~ 1 mjy in the low-z 
sample. Unfortunately, the number of such low-z sources is 
very limited and insufficient to allow for a statistically sig- 
nificant estimate of w(9). However, as a sensible alternative 
we can consider a sub-sample of low-z sources brighter than 
the chosen 0.4 mjy limit, which contains enough objects 
to provide meaningful results for the two-point correlation 
function. We have then decided to restrict our analysis to 
low-z galaxies brighter than = 0.5 mjy. We note that, 

despite the marginal flux increment, this new sample only 
contains 210 sources, i.e. almost a factor 2 less than the orig- 
inal, Fiifim ^ 0.4 mjy, dataset. The corresponding w(8) is 
shown by the (green) squares in Figure 8. 
Under the assumption of no cosmological evolution for those 
24/im-selected galaxies residing at z ~ 2 and taking into ac- 
count the different volumes occupied by the two low-z and 
high-z samples, we would expect ~ 80 very massive sources 
in the 0.6 ^ z ^ 1.2 redshift interval. And if these sources 
indeed existed at z ~ 1, they would constitute ~ 40% of the 
F^Afim ^ 0.5 mjy low-z sample, to be compared to ~ 25% of 
the i*24/«n 1? 0.4 mjy one. This sensible increase in the rela- 
tive weight of very massive sources (which now would make 
only slightly less than 50% of the entire low-z dataset) in the 
z ~ 1 sample is expected to determine a sensible boost of 
the correlation function signal. But this is does not happen, 
as the observed clustering amplitude of FiA^m 0.5 mjy 
objects is virtually identical to that obtained in §3.1 (and 
reproduced in Figure 8 by the red circles for sake of clarity) 
for a lower flux cut. 

Massive galaxies undergoing intense activity such as 
star-formation or AGN accretion seem to disappear when 
going from redshifts ~ 2 to ~ 1. At this lower redshifts, 
obscured star forming and AGN activity seem to be segre- 
gated in much lower-mass systems. Obviously, the present 
data cannot say anything on the eventual presence of active 
low-mass galaxies at redshift ~ 2, but they tend to strongly 
exclude the presence of intense activity from very massive 
systems below redshifts - say - 1.5. 

4.1 The Halo Bias Approach 

One might wonder about the need of using somewhat so- 
phisticated tools such as the Halo Occupation formalism in 
the case of datasets characterized by a low signal-to-noise 
ratio as the ones we are dealing with. To answer this ques- 
tion, we have considered the more standard halo bias ap- 
proach (Mo & White 1996) which describes the clustering 
signal of a chosen population of extragalactic sources as the 
product between the two-point correlation function of the 
dark matter and the square of the so-called bias function, a 
quantity which solely depends on the minimum mass M m i n 
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of the halos in which the astrophysical objects reside. Such 
an approach assumes a one-to-one correspondence between 
dark matter halos and hosted sources and can be thought 
as a special case within the HON framework corresponding 
in equation (5) to a = 0, iVo = 1 and a(M) = 0. Once 
the cosmology is fixed, this is a 1-parameter model which 
might seem more appropriate for the description of cluster- 
ing signals affected by large noise. The angular two-point 
correlation functions w(9) predicted by the halo bias model 
for the low-z and high-z samples have then been computed 
once again by following eq.(2) and the best values for M m i n 
in the two cases have been found by a ^-Ht to the ob- 
served clustering signal. We find LogM m i n = 13.3lg'4 an d 
LogM m i n = 12.4^5J'g respectively for the high-z and low- 
z sample, and the corresponding w(0)'s are represented in 
Figures 5a and 5b by the (cyan) long-short dashed curves. 
These values are in good agreement with those obtained by 
using the full HON approach presented earlier in this Sec- 
tion, the somewhat higher figures found for M m i n in this 
latter case being simply due to the fact that a and M m i n 
are covariant quantities, so that within the HON framework 
higher a in general correspond to lower M m i n . This provides 
reassuring evidence for the goodness of the results derived 
from the HON analysis. 

On the other hand, the 1-parameter halo bias model 
suffers from a number of problems which can instead be 
overcome by using the full HON approach. The first prob- 
lem is encountered once the values of M m -, n as obtained 
above are plugged in the calculation of the number den- 
sity of sources via eq. (4). In the high-z case, the figure we 
obtain (n ~ 1.5- 1CP 5 Mpc~ 3 ) is in good agreement with the 
observational findings presented in §2. However, for the low- 
z sample we find n ~ 8.5 ■ 1CP 3 Mpc~ 3 , value which is about 
10 times larger that what observationally determined. This 
big discrepancy stems from the erroneous a priori assump- 
tion of the 1-parameter halo bias model for a one-to-one 
correspondence between dark matter halos and astrophysi- 
cal sources. As shown by the results of the full HON analysis 
(cfr. Table 2), such a working hypothesis could hold for the 
high-z sample which presents No values not too far from 
unity, but badly fails when one deals with rare sources such 
a those characterizing the low-z sample. The HON approach 
instead provides a self-consistent answer for both the clus- 
tering properties and the number density of z ~ 1 sources. 

A second problem related to 1-parameter models such 
as the halo bias is that, by merely concentrating on the 
clustering properties of the dark matter halos, they are un- 
able to describe the clustering behaviour of astrophysical 
sources on sub-halo scales. This feature is evident in Fig- 
ures 5a and 5b. Although reproducing the large-scale trend 
of the observed w(9)'s on large scales, the halo bias model 
systematically underestimates the observed clustering sig- 
nal at small angular distances. The problem is particularly 
bad at the smallest angular scales probed by our analysis 
whereby, in the case of z ~ 2 sources, this model predicts 
~ 3 pairs to be compared to the 9 (or 10, see §2) observed 
in our dataset. For the lower redshift sample the predicted 
figure is again ~ 3, while the number of observed pairs is 11. 
The full HON approach instead works much better at de- 
scribing the clustering properties of extragalactic sources at 
all scales and can provide a correct match between observed 
and predicted quantities even at the shortest distances (the 



number of predicted pairs at 9 ~ 0.0015 degrees within the 
HON framework is ~ 5 and ~ 9 respectively for a NFW 
and a PL3 model at z ~ 2 and ~ 4 and ~ 11 at z ~ 1). 
We note, for comparison, that the '7 model' exploited in §3, 
gives N iow - z ~ 6 and N hig h-z ~ 5. 



5 CONCLUSIONS 

We have investigated the clustering properties of 24pim- 
selected galaxies brighter than 400piJy, drawn from the 
SWIRE and UKIDSS Ultra Deep Surveys. With the help of 
photometric redshift determinations, we have concentrated 
on two datasets which include galaxies respectively in the 
z — [0.6 — 1.2] (low-z sample) and 2 > 1.6 (high-z sample) 
redshift ranges. The low-z sample is made of 350 galaxies, 
while the high-z sample includes both 182 galaxies with es- 
timated photo-z's and another 28 sources which do not have 
any optical or near-IR counterpart. Diagnostics based on 
the ratio between 8/xm and 24/jm fluxes indicate that these 
two samples are likely made by a very similar mixture of 
active star-forming galaxies (~ 65 per cent) and AGN (the 
remaining ~ 35 per cent). 

Results obtained by fitting with a power-law of fixed 
slope 7 = 1.8 the observed angular two-point correlation 
function w(9) report an amplitude A hz = 0.010 ± 0.0035 
and A lz = 0.0055lo;°o20 ( w ( e ) = A respectively for 

the high-z and low-z samples. In more physical coordinates, 
the above results imply (comoving) correlation lengths for 
the spatial two-point correlation function £ (modelled as 
t = (r/r )-<) 4 Z = 15.9±H Mpc and r l z = 8.5±^;| Mpc, 
showing that the galaxies in the high-z sample are more 
strongly clustered that those found at lower redshifts. 

A deeper insight on the above findings is provided by 
the so-called Halo Occupation Scenario which connects the 
clustering properties of a chosen population of astrophysi- 
cal objects with some of their physical properties. The key 
quantity for this kind of analysis is the Halo Occupation 
Number (HON), i.e. the average number of galaxies enclosed 
in a halo of given mass M, which can be parametrized as 
(N) = No (M/Mmin)™, with Mmin minimum mass for a halo 
to host one of such galaxies and No normalization factor 
which provides information on how common the galaxies 
under examination are. 

At first we have adopted as a working hypothesis that 
Spitzer-selected galaxies were spatially distributed within 
their dark matter halos according to the dark matter (NFW) 
distribution. However, we find that such a model fails at re- 
producing the high amplitude of the observed w(9) signal 
on angular scales < 0.003 degrees both in the low-z and 
high-z sample. A much better agreement with the data is 
obtained when we take the galaxies to be very concentrated 
towards their halo centres, e.g. by assuming a distribution of 
the kind p oc r~ 3 . Such a high concentration - not observed 
in local data (Magliocchetti & Porciani 2003) - is sugges- 
tive of a strong interaction and close encounters between 
galaxies which reside in the proximity of the halo centres, 
and we expect such a strong interaction to eventually drive 
a substantial number of gas-rich mergers which could easily 
trigger the observed enhanced star-formation activity. 

Investigations of the best-fit values for the HON report 
values Log(M m f n /M Q ) = 12.81°;*; LogJStf* = -0.7t°,j; 
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a hz _ o.7±°;g for the high-z sample and Log (M^ in /M ) = 
ll.Stgj; LogiV^ = — 2.1±o;|; a lz = 0.7±g;g in the case of 
low-z sources. The above figures indicate that galaxies be- 
longing to the high-z sample are exclusively associated with 
very massive, M > 1O 12 ' 8 M0, structures, identifiable with 
groups-to-clusters of galaxies. Furthermore, these galaxies 
are quite common within their halos (in the most extreme 
case, our results imply that more than one in two of all 
the structures with masses M ~ M m j n host a z ~ 2, 
400/iJy galaxy), and their number sensibly in- 
creases with the richness of the structure which hosts them. 
On the other hand, the low-z sample seems to be made of 
galaxies of much smaller mass (M min ~ 10 U ' 8 M Q ). These 
objects are also very rare; in the most unfavourable case we 
get that only 0.1% of the 'allowed' halos host a galaxy of 
the kind which make the low-z sample. 

Such a remarkable difference in the clustering and envi- 
ronmental properties of active sources as seen at z ~ 2 and 
z ~ 1 can be hardly attributed to the fact that high red- 
shift galaxies are intrinsically brighter than their lower red- 
shift counterparts (cfr. §4). Indeed, despite the large uncer- 
tainties determined by the poor statistics of the considered 
datasets, our results indicate that the populations probed 
by the high-z and low-z samples are very different from each 
other: massive active galaxies seem to disappear when going 
from z ~ 2 to z ~ 1, and at this lower redshifts all the AGN 
and starforming activity appears to be segregated in much 
lower-mass systems (see also the results of Gilli et al. 2007 
who find for their bright, z ~ 0.7, 24/^m-selected galaxies 
in the GOODS fields a correlation length ro ~ 8 Mpc). We 
stress that, while investigations of the luminosity function of 
8/im-rest frame-selected sources have already established a 
strong luminosity evolution of sources between redshifts ~ 2 
and ~ 1 (e.g. Caputi et al. 2007), clustering measurements 
provide a unique tool to determine the physical nature of 
such an evolution. Our work can in fact show that the strong 
differential evolution of the 24/^m LF is not simply due to 
the fact that the less luminous sources at z ~ 1 are dimmed 
versions of the galaxies at higher-z (i.e. pseudo-passive evo- 
lution), but indeed has to be attributed to different popula- 
tions of objects inhabiting different dark matter haloes and 
structures. 

The general picture which then emerges from the results 
of this paper points to a differential evolution for high-mass 
and low-mass systems. Massive systems form early in time 
at the rare peaks of the density field. Their intense activ- 
ity, both in terms of star-formation and AGN accretion, is 
strongly favoured (if not triggered) by numerous close en- 
counters and gas-rich mergers happening in the proximity 
of the centres of the massive/cluster-like structures in which 
these sources reside. Such an active phase is relatively short- 
lasted and already below z ~ 1.5 these objects evolve as 
optically passive galaxies, in most of the cases ending up 
as the supermassive galaxies which locally reside in cluster 
centres and which show a remarkable tendency for enhanced 
radio activity (see e.g. Best et al. 2006; Magliocchetti & 
Briiggen 2007). Lower (M < 1O 12 M0) mass systems are in- 
stead characterized by an active phase which lasts down to 
lower redshifts. Also in this case, the AGN and star forming 
activities seem to be favoured by the strong concentration 
of these sources towards their halo centres. These objects 



will eventually end up as 'typical' early-type galaxies which 
favour relatively (but not extremely) dense environments. 

At present, studies on the evolutionary properties of 
dust-enshrouded active systems like those we have investi- 
gated in this work are hampered by the lack of surveys which 
can probe their peak of emissivity to low enough fluxes and 
also by the extreme difficulty of getting spectroscopic red- 
shifts for such optically faint sources. The forthcoming ad- 
vent of instruments such as Herschel, SCUBA-2 and ALMA 
will finally fill in this gap and provide the community with 
'the ultimate truth' on most of the issues connected with 
galaxy formation and evolution. 
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